NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Disentangling Extraction and Reasoning in Multi-hop Spatial Reasoning

https://doi.org/10.18653/v1/2023.findings-emnlp.221

Mirzaee, Roshanak; Kordjamshidi, Parisa (January 2023, Association for Computational Linguistics)

Spatial reasoning over text is challenging as the models not only need to extract the direct spatial information from the text but also reason over those and infer implicit spatial relations. Recent studies highlight the struggles even large language models encounter when it comes to performing spatial reasoning over text. In this paper, we explore the potential benefits of disentangling the processes of information extraction and reasoning in models to address this challenge. To explore this, we design various models that disentangle extraction and reasoning(either symbolic or neural) and compare them with state-of-the-art(SOTA) baselines with no explicit design for these parts. Our experimental results consistently demonstrate the efficacy of disentangling, showcasing its ability to enhance models{'} generalizability within realistic data domains.
more » « less
Full Text Available
Transfer Learning with Synthetic Corpora for Spatial Role Labeling and Reasoning

Mirzaee, Roshanak; Kordjamshidi, Parisa (December 2022, Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing)

Recent research shows synthetic data as a source of supervision helps pretrained language models (PLM) transfer learning to new target tasks/domains. However, this idea is less explored for spatial language. We provide two new data resources on multiple spatial language processing tasks. The first dataset is synthesized for transfer learning on spatial question answering (SQA) and spatial role labeling (SpRL). Compared to previous SQA datasets, we include a larger variety of spatial relation types and spatial expressions. Our data generation process is easily extendable with new spatial expression lexicons. The second one is a real-world SQA dataset with human-generated questions built on an existing corpus with SPRL annotations. This dataset can be used to evaluate spatial language processing models in realistic situations. We show pretraining with automatically generated data significantly improves the SOTA results on several SQA and SPRL benchmarks, particularly when the training data in the target domain is small.
more » « less
Full Text Available
GLUECons: A Generic Benchmark for Learning Under Constraints

https://doi.org/10.1609/aaai.v37i8.26143

Rajaby Faghihi, Hossein; Nafar, Aliakbar; Zheng, Chen; Mirzaee, Roshanak; Zhang, Yue; Uszok Andrzej; Wan, Alexander; Premsri, Tanawan; Roth, Dan; Kordjamshidi, Parisa (July 2023, The 37th Conference of Artificial Intelligence (AAAI-2023))

Recent research has shown that integrating domain knowledge into deep learning architectures is effective – it helps reduce the amount of required data, improves the accuracy of the models’ decisions, and improves the interpretability of models. However, the research community is missing a convened benchmark for systematically evaluating knowledge integration methods. In this work, we create a benchmark that is a collection of nine tasks in the domains of natural language processing and computer vision. In all cases, we model external knowledge as constraints, specify the sources of the constraints for each task, and implement various models that use these constraints. We report the results of these models using a new set of extended evaluation criteria in addition to the task performances for a more in-depth analysis. This effort provides a framework for a more comprehensive and systematic comparison of constraint integration techniques and for identifying related research challenges. It will facilitate further research for alleviating some problems of state-of-the-art neural models.
more » « less
Full Text Available

Search for: All records